ENHANCEMENT OF DISCRIMINATIVE CAPABILITIES OF HMM BASED RECOGNIZER THROUGH MODIFICATION OF VITERBI A - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
نویسنده
چکیده
The algorithm proposed in this paper integrates the concepts of variable frame rate and discriminative analysis based on Tanimoto ratio to modify the conventional Viterbi algorithm, in such a way that the steady or stationary signal is compressed, while transitional or non-stationary signal is emphasized through the frame-by-frame searching process. The usefulness of each frame is decided entirely within the Viterbi process and needs not to be the same for different models. To evaluate this algorithm, we tested a speech database of 9 highly confusable E-set English letters. With 5 state and 6 mixture components, the conventional HMM baseline system only delivered the recognition accuracy of 73.9%. In the preliminary experiment using the algorithm proposed in this paper, the recognition accuracy was increased to 8:2.5%.
منابع مشابه
MICROSOFT WINDOWS HIGHLY INTELLIGENT SPEECH RECOGNIZER: WHISPER - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Since January 1993, we have been working to refine and extend Sphinx-I1 technologies in order to develop practical speech recognition at Microsoft. The result of that work has been the Whisper (Windows Highly Intelligent Speech Recognizer). Whisper represents significantly improved recognition efficiency, usability, and accuracy, when compared with the Sphinx-I1 system. In addition Whisper offe...
متن کاملSequential homotopy-based computation of multiple solutions to nonlinear equations
IEEE Intl. Conf. Acoustics, Speech & Signal Processing (ICASSP) May 1995 Homotopy methods have achieved significant success in solving systems of nonlinear equations for which the number of solutions are known and the homotopy paths are bounded. We present a twostage homotopyprocess which does not require a-priori knowledge of the number of solutions to a system of nonlinear equations. This app...
متن کاملMarkov model-based phoneme class partitioning for improved constrained iterative speech enhancement
171 A. Benyassine and H. Abut, “Mixture excitations and finite-state CELP speech coders,” in Proc. IEEE ICASSP., Mar. 1992, pp. 1-345-1-348. P. Krmn and B. S. Atal, “Strategies for improving the performance of CELP coders at low bit rates,” in Proc. IEEE ICASSP, Apr. 1988, pp. 151-154. P. moon and B. S. Atal, “On the use of pitch predictors with high temporal resolution,” IEEE Truns. Acoust., S...
متن کاملEXPERIMENTAL EVALUATION OF SEGMENTAL HMMS - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
The aim of the research described in this paper is to overcome important speech-modeling limitations of conventional hidden Markov models (HMMs), by developing a dynamic segmental HMM which models the changing pattern of speech over the duration of some phoneme-type unit. As a first step towards this goal, a static segmental HMM [3] has been implemented and tested, This model reduces the influe...
متن کاملDSP-BASED MOBILE AND SATELLITE RECEIVERS, FROM ALGORITHM TO IMPLEIMENTATION: A DESIGN COURSE AT AACH - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Profound knowledge of the interaction between algorithms and digital signal processor (DSP) architectures is required to be able to efficiently design complex communications equipment. Whereas both algorithms and architecture find treatment in many courses individually, education focusing on design methodology for DSP implementation is found to be rare. This contribution describes a concept and...
متن کامل